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ABSTRACT The green fluorescent protein (GFP) of the 
jellyfish Aequorea victoria is an unusual protein with strong 
visible absorbance and fluorescence from a />-hydroxybenzyli- 
dene-imidazolidinone chromophore, which is generated by 
cyclization and oxidation of the protein's own Ser-Tyr-Gly 
sequence at positions 65-67. Cloning of the cDNA and heter- 
ologous expression of fluorescent protein in a wide variety of 
organisms indicate that this unique posttranslational modifi- 
cation must be either spontaneous or dependent only on 
ubiquitous enzymes and reactants. We report that formation of 
the final fluorophore requires molecular oxygen and proceeds 
with a time constant (~4 hr at 22°C and atmospheric p0 2 ) 
independent of dilution, implying that the oxidation does not 
require enzymes or cofactors. GFP was mutagenized and 
screened for variants with altered spectra. The most striking 
mutant fluoresced blue and contained histidine in place of 
Tyr-66. The availability of two visibly distinct colors should 
significantly extend the usefulness of GFP in molecular and cell 
biology by enabling in vivo visualization of differential gene 
expression and protein localization and measurement of pro- 
tein association by fluorescence resonance energy transfer. 



Proteins are often labeled with fluorescent tags to detect their 
localization and sometimes their conformational changes 
both in vitro and in intact cells. Such labeling is essential both 
for immunofluorescence and for fluorescence analog cy- 
tochemistry, in which the biochemistry and trafficking of 
proteins are monitored after microinjection into living cells 
(1). Traditionally, fluorescence labeling is done by purifying 
proteins and then covalently conjugating them to reactive 
derivatives of organic fluorophores. The stoichiometry and 
locations of dye attachment are often difficult to control, and 
careful repurification of the proteins is usually necessary. If 
the proteins are to be used inside living cells, a final chal- 
lenging step is to get them across the plasma membrane via 
micropipet techniques or various methods of reversible per- 
meabilization. 

An alternative would be to devise molecular biological 
means to generate fluorescent proteins. The natural UV 
fluorescence of tryptophan residues is of little use except in 
the simplest in vitro samples, because the wavelengths are 
too short and tryptophan is too ubiquitous for any one protein 
to stand out in a complex biological mixture. A more prom- 
ising strategy would be to concatenate the gene for the 
nonfluorescent protein of interest with the gene for a natu- 
rally fluorescent protein and express the fusion product. The 
most highly fluorescent proteins known are the phycobilip- 
roteins (2), but their fluorescence depends entirely on correct 
enzymatic insertion of a difficult-to-obtain tetrabilin chro- 
mophore into a large apoprotein (3). The green fluorescent 
protein (GFP) from the jellyfish Aequorea victoria is a much 
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smaller molecule of 238 amino acids, whose natural function 
seems to be to convert the blue chemiluminescence of the 
Ca 2+ -sensitive photoprotein aequorin into green emission (4). 
GFP's absorption bands in the blue and emission peak in the 
green do not arise from a distinct cofactor but rather from an 
internal p-hydroxybenzylideneimidazolidinone chro- 
mophore generated by cyclization and oxidation of a Ser- 
Tyr-Gly sequence at residues 65-67 (5). When the gene for 
GFP was first cloned (6), whether jellyfish-specific enzymes 
were required for this posttranslational modification was 
unknown. Heterologous expression of the gene in Esche- 
richia co//(7, 8), Caenorhabditis elegans (7), Saccharomyces 
cerevisiae (R.H., S. D. Emr and R.Y.T., unpublished data), 
and Drosophila meianogaster (9) showed that additional 
Aequorea-spccific enzymes were not required because the 
protein became brightly fluorescent in all these organisms. 

The ability to generate fluorescence in situ by expressing 
the gene for GFP has opened up tremendous possibilities for 
continuously monitoring gene expression, cell developmen- 
tal fates, and protein trafficking in living, minimally perturbed 
cells, tissues, and organisms. Nevertheless, major questions 
about GFP itself remain. What is the mechanism of fluoro- 
phore formation? How does fluorescence relate to protein 
structure? Can its fluorescence properties be tailored and 
improved — in particular, to provide a second distinguishable 
color for comparison of independent proteins and gene ex- 
pression events? This study provides initial answers. 

MATERIALS AND METHODS 

The coding region of a clone of A. victoria gfplO cDNA (6) 
was amplified by the PCR (7) to create Nde I and BarnHl sites 
at the 5' and 3' ends, respectively, and cloned behind the T7 
promoter of pGEMEX-2 (Promega), replacing most of the T7 
gene 10. The resulting plasmid was transformed into E. coli 
strain JM109(DE3), and high-level expression was achieved 
by growing the cultures at 24°C to saturation without induc- 
tion by isopropyl 0-D-thiogalactoside. Random mutagenesis 
of the gjp cDNA was done by hydroxylamine treatment (10) 
or by increasing the error rate of the PCR with 0.1 mM MnCl 2 , 
50 fM dATP, and 200 fjM of dGTP, dCTP, and dTTP (11). 
The product was ligated into pGEMEX-2 and subsequently 
transformed into strain JM109(DE3). Colonies on agar were 
visually screened for different emission colors and ratios of 
brightness when excited at 475 vs. 395 nm, supplied by a 
xenon lamp and grating monochromator for which the output 
beam was expanded to illuminate an entire culture dish. 

To prepare soluble extracts, cells from a 1.5-ml suspension 
were collected, washed, and resuspended in 150 fi\ of 50 mM 
Tris-HCl, pH 8.0/2 mM EDTA. Lysozyme and DNase I were 
added to 0.2 mg/ml and 20 ug/ml, respectively, and the 
samples were incubated on ice until lysis occurred (1-2 hr). 



Abbreviation: GFP, green fluorescent protein. 
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Fig. 1. GFP expression in E. coli: Variations in fluorescence 
properties despite equalized apoprotein contents. Different versions 
of GFP are compared by gel electrophoresis and Coomassie blue 
staining. Soluble extracts of E. coli expressing GFP show a predom- 
inant band that is absent in extracts from control cells and has the 
same electrophoretic mobility as native GFP isolated from the 
jellyfish A. victoria (5). Inclusion bodies of expressing cells consist 
mainly of nonfluorescent GFP, which has the same mobility as 
soluble GFP. Nonfluorescent soluble GFP of anaerobically grown 
(anerob) cultures is also a major band vyith correct mobility. Soluble 
extracts of the mutated clones H9 (S202F, T203I), P9 (I167V), Pll 
(I167T), and P4 (Y66H) again contain a dominant protein with 
essentially the same molecular mass. 

The lysates were then clarified by centrifuging at 12,000 x g 
for 15 min. Anaerobic cultures were grown in GasPak 
pouches (Becton Dickinson). Inclusion bodies were obtained 
as described (12). 

Excitation and emission spectra were measured with 
1.8-nm band widths, and the nonscanning wavelength was set 
to the appropriate peak. Excitation spectra were corrected 
with a rhodamine B quantum counter, whereas emission 
spectra (except for mutation Y66W) were corrected for 
monochromator and detector efficiencies by using manufac- 
turer-supplied correction spectra. 

RESULTS AND DISCUSSION 

GFP was expressed in E. coli under the control of a T7 
promoter for quantitative analysis of the properties of the 
recombinant protein. Gel electrophoresis under denaturing 
conditions showed protein of the expected molecular mass (27 
kDa) as a dominant band (Fig. 1), which could be quantified 
simply by densitometry of staining with Coomassie blue. 
Soluble recombinant GFP proved to have identical spectra and 
the same or even slightly more fluorescence per mol of protein 
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as GFP purified from A. victoria, showing that the soluble 
protein in E. coli undergoes correct folding and oxidative 
cyclization with as high an efficiency as in the jellyfish. The 
bacteria also contained inclusion bodies consisting of protein 
indistinguishable from jellyfish or soluble recombinant protein 
on denaturing gels (Fig. 1). However, this material was com- 
pletely nonfluorescent, lacked the visible absorbance bands of 
the chromophore, and did not become fluorescent when 
solubilized and subjected to protocols that renature GFP (13). 
Therefore protein from inclusion bodies seemed unable to 
generate the internal chromophore. 

An interesting intermediate stage in protein maturation 
could be generated by growing the bacteria anaerobically. 
The soluble protein again looked the same as GFP on 
denaturing gels (Fig. 1) but was nonfluorescent. In this case, 
fluorescence gradually developed after admission of air, even 
when fresh protein synthesis was blocked by using puromy- 
cin and tetracycline. Evidently the soluble nonfluorescent 
protein synthesized under anaerobic conditions was ready to 
become fluorescent once atmospheric oxygen was readmit- 
ted. The fluorescence per protein molecule approached its 
final asymptotic value with a single-exponential time course 
and a rate constant of 0.24 ± 0.06 hr" 1 (at 22°C), measured 
either in intact cells with protein-synthesis inhibitors or in 
lysates in which the soluble proteins and cofactors were ^lO 5 
more dilute than in cells. Such pseudo-first-order kinetics 
strongly suggest that no enzymes or cofactors are necessary 
for the final step of fluorophore formation in GFP. A tentative 
molecular interpretation is presented in Fig. 2. If the newly 
translated apoprotein evades precipitation into inclusion bod- 
ies, the amino group of Gly-67 might cyclize onto the car- 
bonyl group of Ser-65 to form an imidazolidin-5-one, where 
the process would stop were O2 absent. The new N=C 
double bond would be expected to promote dehydrogenation 
to form a conjugated chromophore; imidazolidin-5-ones un- 
dergo autoxidative formation, of double bonds at the 4-posi- 
tion (14, 15), which would 'complete the fluorophore (Fig. 2, 
upper right). Because fluorophore formation requires at least 
one step with a time constant of M hr, use of GFP as a 
reporter protein to monitor faster changes in promoter ac- 
tivity may be problematic;: - 

A major question in protein photophysics is how a single 
chromophore can give widely different spectra depending on 
its local protein environment. This question has received the 
most attention with respect to the multiple colors of visual 
pigments based on retinal (16) but is also important in GFP. 
The GFP from Aequorea and that of the sea pansy Renilla 
reniformis share the same chromophore, yet Aequorea GFP 
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has two absorbance peaks at 395 and 475 nm, whereas Renilla 
GFP has only a single absorbance peak at 498 nm, with 
~5.5-fold greater monomer extinction coefficient than the 
major 395-nm peak of the Aequorea protein (17). The isolated 
chromophore and denatured protein at neutral pH do not 
match either native protein (5). For many practical applica- 
tions, the spectrum of Renilla GFP would be preferable to that 
of Aequorea because wavelength discrimination between dif- 
ferent fluorophores and detection of resonance energy transfer 
are easier when the component spectra are tall and narrow 
rather than low and broad. Furthermore, the longer wave- 
length excitation peak (475 nm) of Aequorea GFP is almost 
ideal for fluorescein filter sets and is resistant to photobleach- 
ing but has lower amplitude than the shorter wavelength peak 
at 395 nm, which is more susceptible to photobleaching (7). 
For all these reasons, it would be interesting to convert the 
Aequorea GFP excitation spectrum to a single peak, prefer- 
ably at longer wavelengths. The cDNA was therefore sub- 
jected to random mutagenesis by hydroxylamine treatment or 
PCR. Approximately six thousand bacterial colonies on agar 
plates were illuminated with alternating 395- and 475-nm 
excitation and visually screened for altered excitation prop- 



erties or emission colors. Although this number of colonies 
falls far short of saturating the possible mutations of a protein 
of 238 residues, interesting variants have already appeared. 
Three mutants were found with significant alterations in the 
ratio of the two main excitation peaks (Fig. 3 and Table 1). 
Compared with wild-type GFP, mutant H9 had increased 
fluorescence at 395-nm excitation, whereas mutants P9 and 
Pll were more fluorescent at 475-nm excitation, mutant Pll 
being the best species for fluorescein filters. The mutations 
were sequenced and recombined with the wild-type gene in 
different ways to eliminate neutral mutations and assign the 
fluorescence effects to single-amino acid substitutions (except 
for mutant H9, where two neighboring mutations have not yet 
been separated). The mutations all lay in the C-terminal part 
of the protein (Table 1), remote in primary sequence from the 
chromophore formed from residues 65-67. Determination of 
the three-dimensional structure of GFP should enable detailed 
interpretation of these spectral perturbations. One possibility 
is that the mutations at Ile-167 shift a positive charge slightly 
closer to the phenolic group of the fluorophore; this shift 
should both increase the percentage of phenolic anion, which 
is probably the species responsible for the 470- to 475-nm 
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GFP 




Excitation 


Emission 


Relative 


Mutation 


maxima,* nm 


maxima, * nm 


fluorescence,* % 


Wild type 


None 


396 (476) 


508 (503) 


(=100) 


Mutant H9 


Ser-202- Phe, 


398 


511 


117§ 




Thr-203 — He 






Mutant P9 


IIe-167 - Val 


471 (396) 


502 (507) 


166* 


Mutant Pll 


Ile-167 -> Thr 


471 (396) 


502 (507) 


188* 


Mutant P4 


Tyr-66 -> His 


382 


448 


57II 


Mutant W 


Tyr-66 -* Trp 


458 


480 


ND 



♦Values in parentheses are lower-amplitude peaks. 
'Primary values were seen when exciting at the main excitation peak; values in parentheses were seen 
when illuminating at the lower-amplitude excitation peak. 

*Equai amounts of protein were used based on densitometry of gels stained with Coomassie blue (Fig. 
1). ND, not done. 

^Emission maxima of spectra recorded at excitation 395 nm were compared. 
"Emission maxima of spectra recorded at excitation 475 nm were compared. 
"Emission spectrum of mutant P4 recorded at 378-nm excitation was integrated and compared with the 

integrated emission spectrum of wild type recorded at 475-nm excitation; both excitation and emission 

characteristics were corrected. 



excitation peak, and shift the emission peak to shorter wave- 
lengths. However, the hypothesized ionizable phenolic group 
would have to be buried inside the protein at normal pH 
because the ratio of 471- to 396-nm peaks in the mutants could 
not be further affected by external pH until it was raised to 10, 
just below the threshold for denaturation. The pH-sensitivity 
of wild-type GFP is similar (18). 

A fourth mutant, P4, was excitable by UV light and 
fluoresced bright blue in contrast to the green of wild-type 
protein (Fig. 4). The excitation and emission maxima were 
hypsochromically shifted by 14 and 60 nm, respectively, from 
those of wild-type GFP. The mutated DNA was sequenced 
and found to contain five amino acid substitutions, only one 
of which proved to be critical, Tyr-66 -> His in the center of 
the chromophore. The surprising tolerance for substitution at 
this key residue prompted further site-directed mutagenesis 
to tryptophan and phenylalanine at this position. Tryptophan 
gave excitation and emission wavelengths intermediate be- 
tween tyrosine and histidine (Fig. 3 and Table 1) but was only 
weakly fluorescent, perhaps due to inefficiency of folding or 
chromophore formation, whereas phenylalanine gave no 
detectable fluorescence. 

The availability of several forms of GFP with such different 
excitation and emission maxima [the most distinguishable 
pair being mutant P4 (Y66H) vs. mutant Pll (I167T)] should 
facilitate two-color assessment of differential gene expres- 




Fig. 4. E. coli producing either no GFP (Left), wild-type GFP 
(Center), or the P4 mutant (Right) in which Tyr-66 is replaced by 
histidine. Approximately equal quantities of bacteria were sus- 
pended in 1-cm cuvettes, excited by a 365-nm transilluminator, and 
photographed through a low-fluorescence filter passing wavelengths 
>450 nm. The bacteria producing no GFP were transformed with 
vector alone. Flecks of green in the two flanking cuvettes are stray 
reflections from the center cuvette. 



sion, developmental fate, or protein trafficking. It may also 
be possible to use these GFP variants analogously to fluo- 
rescein and rhodamine to tag interacting proteins or subunits 
whose association could then be monitored dynamically in 
intact cells by fluorescence resonance energy transfer (19, 
20). Such fluorescence labeling via gene fusion would be 
site-specific and would eliminate the present need to purify 
and label proteins in vitro and microinject them into cells. 
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